CDS

Accession Number TCMCG039C00404
gbkey CDS
Protein Id XP_024017667.1
Location complement(join(64974..65102,65590..65697,65783..66026,66579..66814,66963..67153,67229..67466,68366..68584))
Gene LOC21389362
GeneID 21389362
Organism Morus notabilis

Protein

Length 454aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA263939
db_source XM_024161899.1
Definition uncharacterized protein LOC21389362 isoform X1 [Morus notabilis]

EGGNOG-MAPPER Annotation

COG_category C
Description formamidase C869.04 isoform X1
KEGG_TC -
KEGG_Module -
KEGG_Reaction R00524        [VIEW IN KEGG]
KEGG_rclass RC02432        [VIEW IN KEGG]
RC02810        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K01455        [VIEW IN KEGG]
EC 3.5.1.49        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00460        [VIEW IN KEGG]
ko00630        [VIEW IN KEGG]
ko00910        [VIEW IN KEGG]
ko01200        [VIEW IN KEGG]
map00460        [VIEW IN KEGG]
map00630        [VIEW IN KEGG]
map00910        [VIEW IN KEGG]
map01200        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCTCAACACGGTCCGCGACTGGTCGTCCCAATCGACGTGAACAAGAAGCCTCGGGAACAGGAGCTTCCCCTCCACAACCGATGGCACCCGGAGATCCCGCCGGTCGCGGAGGTCGCCGCCGGCGAGGTCTTCAGAGTTGAAATGGTGGATTTCAGCGGCGGCGGTATTACCAAGGAATTCACCGCCCATGACATCAAACACGCCGATCCTTATATCGTTCATTATCTCAGTGGGCCTATCAGAATATTGGACAAGGATGGAGCTGCAGCCAAGCCAGGGGATCTTCTGGCGGTTGAGATCTGCAACTTGGGTCCACTTCCAGGAGATGAATGGGGTTACACAGCCACATTCGATCGAGAGAATGGCGGCGGTTTTCTGACCGACCATTTTCCTTGTGCAACCAAAGCTATTTGGTATTTTGAAGGAATATATGCTTACTCTCCTCAAATACCAGGAGTAAGATTTCCGGGTTTAACTCACCCGGGAATAATTGGAACAGCGCCATCAATGGAACTCCTCAATATATGGAATGACAGGGAGAGAGAGCTGGTTGAGAATGGACTCGAGTCTCTGAAACTATGTGAAGTCTTGCATCAACGACCATTGGCCAACTTACCAACAACAAAAGGTTGTGTCCTCGGAAAGATCAAAGAGGGCACTCCTGAATGGGAGAAGATAGCTCAGGAGGCTGCAAGGACGATACCGGGGAGAGAAAATGGTGGAAACTGCGACATAAAAAACCTTAGTAGAGGATCAAAGATATACCTTCCTGTATTTGTAGAAGGAGGAAATCTCAGTACTGGTGATATGCACTTCTCTCAGGGCGATGGTGAAGTCTCATTCTGTGGGGCAATTGAGATGAGTGGCTTTCTGGATCTCAAATGTGAGATTATAAGGGATGGAATGAAAGAGTATCTGACGCCAATGGGGCCAACTCCTCTTCATGTGAACCCAATATTTGAGATAGGACCTGTTGAACCAAGATTCTCAGAATGGCTGGTATTTGAGGGTATAAGTGTTGATGAGAGCGGGAGGCAGCACTATCTCGACGCAACCGTTGCTTACAAGCGTGCAGTACTTAATGCCATTGACTACCTCTCCAAATTTGGATATTCCAAAGAACAGGTCTACCTTCTGTTATCCTGCTGCCCATGTGAAGGGAGGATTTCTGGTATAGTCGATTCCCCCAATGCTGTGGCAACTCTGGCAATTCCAACTGCAATATTTGATCAGGATATTCGTCCAAAAGCCGGCAAAGTGCCAGTTGGACCCCGGCTAGTGAGGAAACCGGACGTCCTGAAATGTAGTTACGATGGAAATTTGCCCACAACTAAGAACCCTTGCTCTAGCTCCACAATCTGA
Protein:  
MAQHGPRLVVPIDVNKKPREQELPLHNRWHPEIPPVAEVAAGEVFRVEMVDFSGGGITKEFTAHDIKHADPYIVHYLSGPIRILDKDGAAAKPGDLLAVEICNLGPLPGDEWGYTATFDRENGGGFLTDHFPCATKAIWYFEGIYAYSPQIPGVRFPGLTHPGIIGTAPSMELLNIWNDRERELVENGLESLKLCEVLHQRPLANLPTTKGCVLGKIKEGTPEWEKIAQEAARTIPGRENGGNCDIKNLSRGSKIYLPVFVEGGNLSTGDMHFSQGDGEVSFCGAIEMSGFLDLKCEIIRDGMKEYLTPMGPTPLHVNPIFEIGPVEPRFSEWLVFEGISVDESGRQHYLDATVAYKRAVLNAIDYLSKFGYSKEQVYLLLSCCPCEGRISGIVDSPNAVATLAIPTAIFDQDIRPKAGKVPVGPRLVRKPDVLKCSYDGNLPTTKNPCSSSTI